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NOVEL LEADER SEQUENCES 
FOR USE IN PRODUCTION OF PROTEINS 

FiELD OF THE INVENTION 

This invention relates to leader sequences for production of proteins. I^ore specifically, 
the invention relates to DNA constructs encoding leader sequences comprising an 
immunoglobulin signal peptide fused to a tissue -type plasminogen activator propeptide. The 
Invention ftjrther relates to the use of these DNA constructe for producing proteins In 
mammalian cells. 

BACKGROUND 

1. Processing of protein precursors 

Secreted proteins are expressed initially Inside the cell in a precursor form containing a 
leader sequence ensuring entry into the secretory pathway. Such leader sequences, named 
signal peptides, direct the expressed product across the membrane of the endoplasmic 
reticulum (ER). Signal peptides are generally cleaved off by signal peptidases during 
translocation to the ER. Once entered in the secretory pathway, the protein is transported to 
the Golgl apparatus. From the Golgi the protein can follow different routes that lead to 
compartments such as the cell vacuole or the cell membrane, or It can be routed out of the cell 
to be secreted to the external medium (F>feffer and Rothman (1987) Ann.Rev.Blochem. 
56:829-852). 

For industrial production of a secreted protein, the protein to be produced needs to be 
secreted efficiently from the host cell or the host organism. The signal peptide may be, e.g., 
the native signal peptide of the protein to be produced, a heterologous signal peptide, or a 
hybrid of native and heterologous signal peptide. Numerous signal peptides are used for 
production of secreted proteins. One of them is a murine immunoglobulin signal peptide (IgSP, 
EiWiBL Accession No. iVI13331). IgSP was first identified in 1983 by Loh et ai. (Ceil. 33:85-93). 
IgSP Is Icnown to give a good expression In mammalian cells. For example, EP patent No. 
0382762 discloses a method of producing horseradish peroxidase by constructing a fusion 
polypeptide between IgSP and horseradish peroxidase. 

l-lowever, several problems are encountered with the use of currently known signal 
peptides. One problem often encountered when producing a human protein from a non -human 
host ceil or organism is that the native signal peptide does not ensure efficient translocation 
and/or cleavage of the signal peptide. This leads to low rates of protein secretion and/or to 
secretion of mature proteins that display N -terminal extensions due to an Incorrect cleavage of 
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the signal peptide. Thus the choice of the signal peptide Is of great importance for Industrial 
production of a protein. 
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In addition of leader sequences directing the secretion of the protein, a precursor fbim 
can comprise supplemental leader sequences that are cleaved during maturation. These 
supplemental leader peptides, named propeptides, usually follow the signal peptide. Virtually 
all peptide hormones, numerous bloactive proteins (for example. gro\«th factors, receptore and 
cell-adhesion molecules), and many bacterial toxins and virai envelope glycoproteins 
comprise a propeptide that is post-transiatlonaliy excised to generate the mature and 
biologically active protein (Seidah and Chretien (1999) Brain Res. 848:45-62). 

Propeptides are deaved off by enzymes named proprotein convertases. Mammalian 
proprotein convertases include, e.g.. the subtilisin convertases PCSK1, PCSK2 and furin. 
Furin is ubiquitously expressed and located in the trans -Golgl network. Furin proteolytically 
activates large numbers of proproteins substrates in secretory pathway compartments. 
(Thomas (2002) Nat Rev Mol Cell Biol. 3:753-766). More specifically, furin localizes to the 
Trans Golgi Networtc - a late Golgi structure that is responsible for sorting secretory pathway 
proteins to their final destinations, including the cell surface, endosomes. lysosomes and 
secretory granules. The site that furin cleaves has been extensively studied. The cleavage site 
is positioned after the carboxyl-tenminal arglnine of the consensus sequence R-X-L/R-R. 
wherein X may represent any amino add (Nakayama (1997) Biochem. J 327:625-635). The 
deavage effidency is increased when X Is a lysine, a valine, an isoleudne or an alanine 
(Watanabe et al (1992) J Btol. Chem. 267:8270-8274). 

2. The tissue-type plasminogen activator precursor 

The human tissue-type plasminogen activator precursor (tPA, SwissProt Acession No. 
P00750) is synthesized as a precursor form of 562 amino adds comprising a leader sequence 
of 35 amino adds. This leader sequence comprises a signal peptide of 23 amino acids 
followed by a propeptide of 12 amino adds. 

Kohne et al. (1999, J. CeU. Biochem. 75 :446-461) showed that the tPA leader 
sequence of 35 amino adds was able to rescue Intracellular transport of a chimeric Tumor 

30 Necrosis Factor Receptor - Immunoglobulin protein (TNFR-Ig) in which all N-linked 
glycosylatlon sites had been deleted. In 1999, Etchevenry et al. reported that a leader 
sequence of 13 amino adds, which comprised the last amino acid of the signal peptide and 
the entire propeptide of tPA, was able to enhance secretion of a TNFR-Ig ftjsfon when inserted 
between the TNFR naHve signal peptide and the TNFR-Ig polypeptide (Etcheverry et al., 

35 ESACT meeting, abstract 01 .07/P1 .02). 
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TJius protein processing Is a fundamental process for efficient protein secretion, and tlie 
dioice of the leader sequence is a criticai step wlien producing a secreted polypeptide. In 
many cases, tiie leader sequence leads to a low level of secretion or no secretion at all, or to 
5 an incorrect or incomplete proteolytic processing. It is therefore the object of the present 
Invention to provide leader sequences that ensure a more efficient secretion and/or 
processing of polypeptides. 

SUMMARY OF THE INVENTION 

10 The present invention Is based on the finding that a leader sequence comprising an 

immunoglobulin signal peptide fused to a tissue-type plasminogen activator propeptide allows 
a more efficient secretion and processing of proteins of Interest than other known leader 
sequences. In addition, it has been found that a leader sequence comprising a truncated form 
of the human tPA propeptide, wherein the carboxyl-terminal extremity of the tPA propeptide 

15 consists of amino adds Arg-Xaa-Arg-Arg, allows an efficient secretion and processing of 
proteins of interest 

Therefore, a first aspect of the invention relates to a DNA construct comprising a 
sequence encoding an IgSP-tPApre-propeptide comprising an immunoglobulin signal peptide 
fused to a tPA propeptide. 

20 A second aspect relates to A DNA construct comprising a sequence encoding a human 

tissue-type plasminogen activator propeptide (tPA) wherein the cartsoxyl -tenmlnal extremity of 
said tPA propeptide consists of amino adds Arg-Xaa-Arg-Arg 

A third aspect relates to a host cell transformed with a DNA construct according to the 
invention. 

25 A fourth aspect relates to a process for the production of a polypeptide of interest 

comprising the step of transfeding a host cell with a DNA construct in accordance with the 
Invention. 

A fifth aspect relates to a process for the production of a polypeptide of Interest 
comprising the step of culturing a host ceil of the invention . 

30 A sixth aspect relates to the use of a DIMA oonstrud in accordance with the invention for 

produdng a polypeptide of interest. 

A seventh asped relates to a fusion polypeptide encoded by a DNA construct in 
accordance with the invention. 



BRIEF DESCRIPTION OF THE FIGURES 

>^gure 1 shows an alignment between an IgSP pre -propeptide in accordance with the 
Invention (SEQ ID NO: 1) and the native tPA pre -propeptide (SEQ ID NO: 2). 

P'PIU''e2 shows a scheme of the pGL3-GH-TBP-1380 vector used to construct the different 
5 signal peptide fused to the TBPI protein. 

.PflUf^ 3 shows a scheme of the pEF1-GH-TBP-1403 vector used to express the leader 
peptides-TBPI fusion proteins in transient transfecOon assays. 

F'fl"'^ 4 shows the amount of TBPI protein detected in supemata nt vereus cytoplasm of 
cells transfected with the Indicated constructs. Lane GH_SP: TBPI fused to the Growth 
10 honmone signal peptide; Lane SEAP_SP: TBPI fused to the secreted alkaline phosphatase 
signal peptide; Lane IgSP: TBPI fused to the murine immunoglo bulin IgG p-heavy chain signal 
peptide; Lane IgSP-tPA: TBPI fused to an IgSP preiDropeptlde in accordance with the 
invention. 

f^mure 5 corresponds to a scheme of the CI^V-UbB-LUC-1433 vector used to express the 
15 leader peptides-TBPI fusion proteins in stable transfecHon assays. 

P'flure 6 shows the amont of TBPI protein detected in the supematant of pools of clones 
transfected with IgSP-tPA or with tPA-tPA pre-propeptides fused to TBPI. Pools were 
maintained eitheir in puromycin and neomycin co -selection (neo/puro) or in puromycin minus 
Hypoxantine-Tymidine co-selection (HT/puro). Open box and dark box represent two different 
20 pulses of 48hrs at 37 ^'C in medium with 10% FCS. Stripped or squared box represent two 
pulses of 48hrs at 32 ""C In seaim-free medium. 



BRIEF DE SCRIPTION OF THE SEQUENCES OF THE SEQUENCE LISTING 

SEQ ID NO: 1 conresponds to the protein sequence of an IgSP-tPA pre-propeptide in 
25 accordance with the Invention. 

SEQ ID NO: 2 corresponds to the protein sequence of the human tPA pre -propeptide. 

SEQ ID NO: 3 corresponds to the protein sequence of the murine IgG m -heavy chain signal 

peptide. 

SEQ ID NO: 4 conresponds to the protein sequence of the human growth honmone signal 
30 peptide. 

SEQ ID NO: 5 conresponds to the protein sequence of the human secreted all^llne 
phosphatase signal peptide. 

SEQ ID NO: 6 con-esponds to the nucleic sequence of an igSP-tPA pre-propeptide in 
accordance with the invention. 
35 SEQ ID NO: 7 con-esponds to the nucleic sequence of the human tPA pre -propeptide. 

SEQ ID NO: 8 corresponds to the nucleic sequence of the murine IgG p -heavy chain signal 
peptide. 
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SEQ ID NO: 9 corresponds to the protein sequence of the soluble portion of the TNF receptor 
p55. 

SEQ ID NO: 10 corresponds to the protein sequence of mature interferon gamma recep tor 
chain. 

SEQ ID Nos. 11 to 20 con"espond to primers used to construct and amplify the Ig m -heavy 
chain signal peptides. 

SEQ ID NO: 21 to 34 conrespond to primers used to construct and amplify the human growth 
honnone signal peptide. 

Seq ID NO: 35 to 41 correspond to primers used to construct and amplify the human secreted 
10 alkaline phosphatase signal peptide. 

Seq ID NO: 42 to 49 correspond to primers used to construct and amplify the IgSP-tPA pre- 
propeptide in accordance \Arith this invention. 

Seq ID NO: 50: correspond to the nucleic sequence of the soluble extracellular portion of the 
p55 Tumor necrosis factor. 
15 Seq ID NO: 51 to 54 correspond to primers used to introduce a deletion of three amino -adds 
into the IgSP-tPA and tPA pre-propeptides in accordance with this Invention. 
Seq ID NO: 55 to 58 correspond to primers used to generate the tPA -pre-propeptide. 

DETAILED DESCRIPTION OF THE INVENTION 

20 The present invention stems from the finding that leader sequences comprising an 

Immunoglobulin signal peptide fused to a tissue-type plasminogen activator propeptide allow a 
more efficient secretion and processing of proteins than other known leader sequences. As 
shown In examples 2 and 3, leader sequences of the present Invention are at least 2 fold more 
efficient in promoting secretion of proteins of interest than prior art leader sequences. 

25 in addition, it has been found that a leader sequence comprising a truncated form of 

the human tPA propeptide, wherein tiie carboxyl-terminal extremity of tiie tPA propeptide 
consists of amino acids Arg-Xaa-Arg-Arg, allows an efficient secretion and processing of 
proteins of Interest. 

Accordingly, the present invention provides novel leader sequences comprising (i) an 
30 immunoglobulin signal peptide fused to a tissue -type plasminogen activator propeptide; or (11) 
a tPA propeptide wherein its carijoxyl -terminal extremity consists of amino acids Arg-Xaa-Arg- 
Arg. The use of these leader sequences for producing proteins of interest In mammalian cells 
is a furUier aspect of the present Invention. 

35 A first aspect of the present invention relates to a DNA construct comprising a 

sequence encoding an IgSP-tPA pre-propeptide comprising an immunoglobulin signal peptide 
fused to a tissue-type plasminogen activator propeptide. 
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As used herein, the term "signal peptide" refers to a leader sequence ensuring entry 
Into the secretory pathway. As used herein, the term " propeptide " refers to a leader sequence 
that follows a signal peptide. As used herein, the term " pre-oropeptide" refers to a leader 
sequence comprising a signal peptide and a propeptide. As used herein, the term leader 
sequence" refers to a sequence located at the amino terminal end of the precursor form of a 
protein. Leader sequences are cleaved off during maturation. 

As further used herein, the temi "loSP-tPA pre-Dmp«>ptiri>." refers to a leader sequence 
according to the present Invention that comprises an Immunoglobulin signal peptide fused to a 
tissue-type plasminogen activator propeptide. 

As shown in example 1. IgSP-tPA pre-propeptldes ensures a more efficient secretion 
of the soluble portion of the TNF receptor p55 (TBPI) than various signal peptides fused 
directly to TBPI. Example 2 demonstrates that the IgSP-tPA pre-propeptide is more efficient In 
promoting TBPI secretion than a tPA pre-propeptlde alone. Example 3 demonstrates that the 
IgSP-tPA pre-propeptlde is more efficient in promoting secretion of the mature Interferon 
15 gamma receptor chain (IFNAR) tiian the native IFNAR signal peptide. Accordingly, IgSP -tPA 
pre-propeptides ensure a more efficient secretion of polypeptides of interest tiian any known 
leader sequence. 

Numerous immunoglobulin (ig) signal peptides from different species are known and 
are all encompassed within the scope of the present invention. In a prefen-ed embodiment, tfie 
20 Ig signal peptide Is a murine Immunoglobulin signal peptide. Preferably, ttie murine Ig signal 
peptkie is a murine IgG p-heavy chain signal peptide of SEQ ID NO: 3. 

The DNA consb^jct of the present invention comprises a sequence encoding a tissue- 
type plasminogen activator (tPA) propeptide of any origin. For example, the DNA construct 
can comprise a sequence encoding a tPA propeptide of human, murine, rat or bovin origin. 
25 Preferably, the DNA construct comprises a sequence encoding a tPA propeptide of hu man 
origin. 

In a further embodiment, ttie DNA construct of tiie present invention comprises a 
sequence encoding a human tPA propeptide, the carboxyl -terminal extremity of said tPA 
propeptide consisting of amino adds Arg-Xaa-Arg-Arg. Such a propeptide corresponds to a 
30 truncated propeptide lacking ttie ttiree carboxyl -tenninal amino acids of ttie native human tPA 
propeptide (see Figure 1). Preferably, ttie human tPA propeptide consists of amino adds 24 to 
32 of SEQ ID NO: 2. 

In a preferred embodiment, ttie DNA con struct of the present Invention encodes a pre- 
propeptide comprising SEQ ID NO: 1. In a most prefeired embodiment, tiie pre-propeptide 
35 consists of SEQ ID NO: 1. 
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The terms " comprising" , " oonslstlna of" , or "consisting e ssentially of" have distinct 
meanings. However, each term may be substituted for another herein to change the scope of 
the invention. 

As used herein, the expression "a polypeptide A fused to a polypeptide B" refers to a 
fusion polypeptide comprising the sequences of polypeptides A and B, wherein the sequence 
of polypeptide A is located at the amino -terminal extremity of the sequence of polypeptide B 
within said fusion polypeptide. The term 'Hised to", as used herein. Is not limited to a direct 
fusion of polypeptides. For example, the cloning strategy may lead to the presence of amino 
acids between polypeptides A and B. However, a direct fusion of polypeptides is preferred. 
Methods of constructing DNA constructs comprising sequences encoding fusion polypeptides 
are well ioiown In the art. For example, methods described in examples 1 to 3 may be used. 

Another piBferred embodiment of the present invention relates to a DNA construct 
encoding a fusion polypeptide comprising an IgSP-tPA pre-propeptide fused to a polypeptide 
of interest 

In accordance with the present invention, the polvpeptlde of interest may be any 
polypeptide for which production is desired. For example, the polypeptide of interest may be, 
e.g., a naturally secreted protein, a nomially cytoplasmic protein, a normally transmembrane 
protein, or a human or a humanized antibody. When the protein of interest Is a normally 
cytoplasmic or a normally transmembrane protein, the protein has preferably be&y engineered 
20 in order to become soluble. The polypeptide of interest may be of any origin. Preferred 
polypeptides of interest are of human origin. 

In preferred embodiments, tiie polypeptide Is selected from the group consisting of 
chorionic gonadotropin, follicle-stimulating hormone, iutropin-choriogonadotropic hormone, 
thyroid stimulating hormone, human growth hormone, interferons (e-g., interferon beta-1a, 
25 Interferon beta-1b), interferon receptors (e.g., interiieron gamma receptor), TNF receptors p55 
and p65, Interteuklns (e.g., lnterieuldn-2, lnterieukln-11), Interieuldn binding proteins (e.g.. 
interieukin-18 binding protein). anti-CD1 1a antibodies, and mutelns, fragments, soluble forms, 
functional derivatives, fusion proteins thereof. 

Other preferred polypeptides of interest include, e.g., erytiiropoietin. granuloc^e 
30 colony stimulating factor, granulocyte-macrophage colony-stimulating factor, pituitary peptide 
hormones, menopausal gonadotropin, insulin-like growth factors (e.g., somatomedin -C), 
keratinocyte growtii factor, glial ceil line-derived neurotrophic factor. thromt>omodulin, basic 
fibroblast growth factor, insulin, Factor VIII, somatropin, bone morphogenetic protein -2, 
platelet-derived growth factor, hirudin, epoietin. recombinant LFA-3/lgG1 fusion protein. 
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glucocerebrosidase, and muteins, fragments, soluble forms, functional derivatives, fusion 
proteins thereof. 
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A second aspect of the present invention is directed to a DNA construct comprising a 
sequence encoding a human tissue-type plasminogen activator propeptide characterized in 
that its carboxyl-terminal extremity consists of amino adds Arg-Xaa-Arg-Arg. 

In a first embodiment, the tPA propeptide of the present Invention is a human tPA 
propeptide consisting of amino adds 24 to 32 of SEQ ID NO: 2. 

In a second embodiment, the DNA construct comprising a tPA propeptide in 
accordance with the Invention further comprises a signal sequence fused to said tPA 
propeptide. 

Any signal peptide cun^ntly used in the art for promoting protein secretion may be 
used in the above embodiment. Such signal peptides Indude, e.g., the human growth 
homnone signal peptide (see. e.g.. EP application 01 999 652.9). the secretion competent 
polypeptide disclosed in EP application 00 906 103.7, tiie human erythropoietin signal 
peptide, the human albumin signal peptide, tiie human secreted alkaline phosphatase signal 
peptide and the rotavirus VP7 glycoprotein signal peptide. 

in a third embodiment, the DNA comprising a tPA propeptide in accordance witii the 
invention encodes a fusion polypeptide comprising said tPA propeptide fused to a polypeptide 



20 of Interest. 



In a prefen-ed embodiment, ttie DNA constoict comprising a sequence encoding an 
IgSP-tPA pre-propeptide and/or a tPA propeptide in accordance witii the invention is Included 
In a vector. 

25 The temn "vector" refers to any earner of exogenous DNA or RNA that Is useful for 

transferring exogenous DNA to a host cell for replication and/or appropriate expression of the 
exogenous DI^ by the host cell. 

In a further preferred embodiment, the vector is an expression vector. An " expression 
vector" comprises appropriate signals that drive expression in host ceils of a polynudeotide 

30 inserted in said vector. Preferably, ihe polynudeotides Inserted in said vector encode a 
polypeptide of interest The appropriate signals indude various regulatory elements, such as 
enhancers and/or promoters from botii viral and mammalian sources. Selectable maricere for 
establishing pemnanent. stable cell dones expressing the products such as, e.g., a dominant 
drug selection, are generally Included in ttie expression vectors of ttie invention, as tiiey are 

35 elements that link expression of the dmg selection maricers to expression of the polypeptide. 
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In a further preferred embodiment, the vector is a gene activation vector. The gene 
activation technology is a technology allowing the production of proteins of interest without 
introducing the gene or cDNA of Interest Into the host cell (see e.g., EP application 91 903 
051.0). For example, the gene activation technology may bypass regulatory DNA sequences 
5 set In the "off position" with regulatory DNA sequences set in the "on position" in order to 
activate the gene of Interest. A " gene activation vector" comprises appropriate signals that 
drive expression in host cells of a polynucleotide present in said host cell. 

in a further preferred emt>odiment the vector is a gene therapy vector. Expression 
vectors that may be used for gene therapy are well known in the art. Preferably, the gene 
10 therepy vector is a lentiviral derived vector, which has been shown to be very efficient in the 
transfer of genes, in particular within the CNS. Other well-established viral vectors, such as 
adenoviral derived vectors, may also be used according to the invention. 

A third aspect of the invention relates to a host cell transformed with a DNA construct 
15 according to the invention. Many host cells are suitable in accordance with the present 
invention, such as primary or established cell lines from a wide variety of eukaryotes 
including plant and animal cells. Preferably, said host cell is an eukaryotic cell. Most 
preferably, said host cell is a mammalian cell. 

For example, suitable host cells include CHO cells, COS cells, CV1 cells, mouse L 
20 cells, HT1080 cells, BHK-21 cells. HEK293 cells, NIH-3T3 ceils. LM cells, Yl cells, NSO and 
SP2/0 mouse hybridoma cells and the like, Namalwa cells, RPMI-8226 cells, Vero cells, Wl- 
38 cells, MRC-5cells or other immortalized and/or transformed cells. 

Preferably, the host cell Is a CHO cell, and more preferably a CHO-S ceil, described 
e.g. by Sholwell et al. (1982, J Biol. Chem. 257-.2974-2980). 

25 

In a fourth aspect, the invention relates to a process for the production of a 
polypeptide of interest comprising the step of transfscting a host cell with a DNA construct 
according to the invention. 

In a fifth aspect, the invention relates to a process for the produ ctlon of a polypeptide 
30 of Interest comprising the step of culturing a host cell in accordance with the invention. 

Such processes according to the invention lead to secretion of the protein of interest, 
which may be han/ested from the cell culture supematant. Depending on the intended use, 
the cell synthetizing the polypeptide may be the product of the process according to the 
invention. 

35 In a preferred embodiment, these processes further comprise the step of isolating the 

polypeptide of interest from the host cells. This step is particularly advantageous and easy to 
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carry out since the protein of Interest may simply be isolated from the cell culture 
supernatant 

These processes may be used in transient, stable, episomai or viral expression 
systems. In a preferred embodiment, the transfectlon Is stable transfection. 

In a snclh aspect, the DNA construct according to the Invention Is used for producing a 
polypeptide of Interest 

A seventh aspect of the Invention relates to fusion polypeptides comprising an IgSP- 
tPA pre-propeptlde and/or a tPA propeptide according to the invention fused to a polypeptide 
of interest 

Having now fully described this Invention, It will be appreciated by those sidlled in the art 
that the same can be performed within a wide range of equivalent parameters without departing 
from the spirit and scope of the Invention and without undue experimentaUcm. 

While this Invention has been described In connection with spedflc embodiments thereof. 
It will be understood that It Is capable of further modifications. This application is Intended to 
cover any variations, uses or adaptations of flie Invention following, In general, the principles of 
tile invention and including such departures from the present disclosure as come within known or 
customary practice within the art to which the invention pertains and as may be applied to the 
essential features hereinbefore set forth as follows In the scope of the appended claims. 

All references dted herein. Including journal articles or abstracts, published or 
unpublished patent application. Issued patents or any otiier references, are entirely incorporated 
by reference herein, including all data, tables, figures and text presented In tiie cited references. 
Additionally, tiie entire contents of ttie references cited witiiin the references cHed herein are also 
entirely incorporated by reference. 

Reference to known metiiod steps, conventional mettiods steps, known metiiods or 
conventional nrjettiods is not any way an admission that any aspect, description or embodiment of 
tiie present Invention Is disctosed. taught or suggested In the relevant art 

The foregoing description of tiie specific embodiments will so fully reveal the general 
nature of tiie Invention that otiiers can, by applying knowledge within the skill of the art (including 
tiie contents of tiie refdrences cited herein), readily modify and/or adapt for various application 
such specific embodiments, without undue experimentation, witiiout departing from the general 
concept of tiie present Invention. Therefore, such adaptations and modifications are Intended to 
be within the meaning and range of equivalente of the disctosed embodiments, based on the 
teaching and guidance presented herein. It is to be understood that the phraseology or 
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terminology herein is for the purpose of description and not of limitation, such that the 
terminology or phraseology of the present specification is to be interpreted by the skilled artisan 
in light of the teachings and guidance presented herein, in combination with the laiowledge of 
one of ordinary sidll in the art 

5 

EXAMPLES 

Example 1: Comparison between the IgSP-tPA pre-propeptide and the human growth 
hormone signal peptide, the secreted alkaline phosphatase signal peptide, 
10 the murine immunooobulin signal peptide , 

1,1. Constructions. 
1.1.1 IgSP 

The murine IgG p -heavy chain signal peptide of SEQ ID NO: 3 (IgSP) cloned as 
15 follows. Primers of Seq ID Nos. 13 to 20 were incubated with the T4 polynudeotide kinase 
(Stratagene) for 2h30 at 37^C, and heat inactivated at ZS^'C for 10 min. The treated primers 
were ligated using cycle ligation with Pfu Ugase from Strategene as recommended by the 
manufacturer in the following cycle conditions: 

- 95**Cfor1min; 

20 - 40 cycles at 95**C for 30", 57°C for 90", 70°C for 2mln 

- 70°Cfor10mln. 

The annealed ollgos were then purified with QIAquIck columns, and PGR amplified with PFU 
turbo using standard conditions with the primer SEQ ID No 13 and 17 under the following PGR 
conditions: 
25 - 95°C5min 

- 30 cycles of 95 °C for 45", 70 ''G for 45"; and 

- 70''Gfor10mln. 

The PGR product was purified and digested with Bgl-ll and BsrGI and cloned into the pGL3- 
GH-TBPI-1380 vector (Figure 2). The product was sequenced and was found to reflect the 
30 expected sequenoe. 

1.1.2 GH^SP 

The human growth hormone signal peptide of SEQ ID NO: 4 (GH_SP) was cloned as 
follows using primers of SEQ ID No. 21 to 34. The primers were annealed as described 
above. At the end of the annealing, the product was purified and re -amplified by PGR using 
35 the same conditions as above. The PGR product was re -amplified using primers of SEQ ID 
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Nos. 21 and 34. The product was then cloned Into the pGL3 -GH-TBPI380 vector at the Bgl-ll 
and BsrGI cloning sites. 
1.1.3. SEAP_SP 

The human secreted all<aline phosphatase signal peptide of SEQ ID NO: 5 (SEAP_SP) 
5 was cloned using prinners of SEQ ID No. 35 to 41. The primers were annealed as described 
above. At the end of the annealing, the product was purified and re -amplified by PGR using 
the same conditions as above. The PGR product was re -amplified using primers of SEQ ID 
Nos. 35 and 40. The product was then cloned Into the pGL3-GH-TBPI380 vector at the Bgl-ll 
and BsrGI cloning sites. 

10 1.1.4. IgSP-tPA 

The studied IgSP-tPA pre-propeptlde comprised: (I) the murine IgG jj -heavy chain 
signal peptide fused to (11) a truncated tPA propeptide that lacks the three carboxyl-temilnal 
amino adds of the native tPA propeptide. This IgSP-tPA pre-propeptlde is shown as SEQ ID 
NO: 1. This IgSP-tPA pre-propeptlde was cloned as follows. 

15 Primers of SEQ ID No. 42 to 49 were annealed as described above. At the end of the 

annealing, the product was purified and re-amplified by PGR using primers of SEQ ID 45 and 
49. The purified PGR product was then cloned into the pGL3 -GH-TBPI380 vectore at the Bgl-ll 
and BsrGI cloning sites. 

A recombinant PGR was performed for deleting three amino-add (GAR) from the 
20 Icnown tPA propeptide. The 5' end of the construct was PGR amplified with primers of SEQ ID 
Nos. 51 and 52. and the 3' end was PGR amplified with primers of SEQ ID Nos. 53 and 54. 
The full-length IgSP-tPA-TBPI constaict was then PGR amplified with primers of SEQ ID Nos. 
51 and 54 and the two recombinant products obtained from the previous PGR. 
1.1.5. Fusion of the atove leader sequences to TBPl 
25 The above leader sequences were fused to the soluble portion of the TNF receptor 

p55 protein (TBPl, SEQ ID NO: 50). The pEFI -GH-TBPI-1403 vector (Figure 3), which allows 
expression of TBPl under the Hunrian Elongation Factor 1 (EF1) promoter, was digested by 
Nco-I and Xba-i. Fragments ^coding the leader sequences were subcloned into pEF1 -GH- 
TBPI-1403 as Nco-I or as Bsa-I -Xba-I fragmente. The resulting IgSP-TBPI, GH_SP-TBPi 
30 SEAP_SP-TBPI and IgSP-tPA constructs were sequenced and were found to reflect the 
expected sequence. 
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1,2. Measurement of protein secretion 

Each constmct was transfected in CHO-DUKX-B11 cells using standard lipid mediated 
transfection as descrik)ed In standard laboratory manuals (ManlaUs et al.. Molecular Cloning: A 
5 Laboratory Manual, 2*^ edition, Cold Spring Harbor Laboratory Press). 48hrs after 
transfecHon, medium and cells were harvested. Cells were washed iwlce in PBS and the 
pellet was lysed on Ice using 300jU of Cytobuster buffer (Novagen) In the presence of a 
protease inhibitor cocktail (Roche). Cellular debries were spinned down by centrif ugation at 16 
000 g for 10 mn at 4 ''C. The supernatant was harvested and kept at -20 ""C before being 
10 processed. The amount of TBPI released in the supematant or In intracellular compartment 
was analyzed by an ELISA. The relative expression of TBPI was measured and reported. 

The results of the experiment are shown on Figure 4. The IgSp-tPA signal propeptide 
is able to boost secretion of TBPI from cells as demonstrated by the increased amount of 
TBPI detected in the supematant versus the amount of TBPI detected in intracelullar 
15 compartments. Thus the IgSP-tPA-TBPI construct, comprising TBPI fused to an IgSP-tPA 
propeptide, increases secretion of TBPI compared to the constructs corresponding to the TBPI 
protein fused to the IgSP signal peptide, to the secreted alkaline phosphatase signal peptide 
or to the growth hormone signal peptide. 

Accordingly, the IgSP-tPA pre-propepUde ensures a more efficient secretion of TBPI 
20 than any other signal peptide Hised directly to TBPI without propeptide. 

Example 2: Comparison between the laSP-tPA prenpropeptlde and the tPA pre- 
propeptide . 

25 2-t- Constructions . 

2.1.1. IgSP'tPA'TBPl 

The IgSP-tPA-TBPl construct described In 1.1.5. was digested by Xba-1. The fragment 
comprising lgSP-ti=»A-TBPl was cloned into tiie pmCMV-UbB-LUC-1433 expression vector 
(Figure 5) digested by Nco-I and Xba-I. 
30 2.1.2. tPA'TBPI 

A tPA pre-propeptide of SEQ ID NO: 2 comprising: (i) the tPA signal peptide and (ii) a 
truncated tPA propeptide that lacks the three carboxyl -terminal amino acids of the native tP A 
propeptide was generated as follows. 

The human tPA pre-propeptide was cloned using tiie IgSP-tPA-TBPI construct as a 
35 template. A first PGR was peribmied with primers of SEQ ID No 55 and 56 In order to amplify 
the tPA propeptide and the 5' end of TBPI. In a second PGR, the PGR product from the first 
step was extended by re-ampllfication witii primers of SEQ ID Nos. 57, 58 and 56. The PGR 
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product was then cloned Into the pGL3-GH-TBPI-1380 vector (Figure 2) digested with Bgl-!l 
and BsrGI. A recombinant PGR was performed as described In 1.1.4. with primers of SEQ ID 
Nos. 51 to 54 In order to introduce an Intemal deletion of three amino-acid (GAR) Into the 
known tPA propeptide. 

The resulting tPA-TBPI constaict was digested by a Bsa-I and Xba-I. The fragment 
comprising tPA-TBPI was cloned into the pmCMV-UbB-LUC-1433 expression vector digested 
Nco-I and Xba-l. 
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2.2. Measurement of protein secmtion 

OHO cells were tiansfected using lipofectamlne with the IgSP-tPA-TBPi and the tPA- 
TBPI constmcte. In one series of experiments, the TBPI expression vector was co-transfected 
with SV40neo and pure vectors for selection, in another series, the TBPI expression vector 
co-transfBcted with SV40dhfr and SV40 pure vectors. Pools of stable expressing clones 
representing at least 100 clones were expanded in different selection medium (600 ng.rrH'^ 
neomycin, 6 |ig.mr^ puromydn. or HT + 6 ng.m|-^ puromycin). Pools were spilt and cells 
seeded either In FCS-contalning or In serum-free medium. The media were harvested after 48 
hrs, and the amount of TBPI released In the supernatant was determined using an ELISA. The 
result of this experiment Is shown In Rgure 6. Each box represente a pool. Open boxes and 
dari< boxes represent two different pulses In 10% FCS-contelning medium at 37 'C. Striped or 
20 squared box represent two different pulses In serum -free medium at 32 °C. The initial number 
of seeded cells and the pulse periods were similar in each experiment 

Rgure 6 shows that in all conditions that were studied, pools of IgSP -tPA-TBPI 
expressing cells had higher titers of TBPI than pools of tPA -TBPI expressing cells. The results 
deariy indicate that the IgSP-tPA construct is at least two fold better than U^A construct in 
25 temis of quantity of secreted protein that Is produced. 

Accordingly, the novel combination of the tPA propeptide with the IgSP signal peptide 
is more efficient at promoting secretion of proteins ttian the tPA pre -propeptide. 



30 



In addition, ttie sequence of ttie N-tenninal ectremlty of the TBPI protein secreted from 
IgSP-tPA-TBPI expressing cells was determined by N -terminal sequencing using Edman 
degradation, it was found ttiat 100% of the proteins had been cleaved after the last arginine 
residue of tiie tPA. Thus ttie IgSP-tPA pre -propeptide ensures an efficient and reliabte 
processing of polypeptides. 
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Examples: Comparison between the IgSP'tPA pre-propeptlde and the Interferon 
aamma receptor signal peptide for production of Interferon gamma , 

5 3-t. Constructs . 

The IgSP pre-pro peptide was fused to a mature Interferon gamma receptor chain 
protein (IFNAR) and cloned into (i) the mCMV-UbB-LUC-1433 vector (Figure 3); or (ii) a vector 
comprising the promoter of the mCMV-IE2 gene described in EP application 03 100 617.4 . 

A full-length IFNAR, comprising the native signal peptide, was cloned into (i) the 
10 mCMV-UbB-LUC-1433 vector; or (ii) the expression vector comprising the promoter of the 
mCIVIV-IE2 gene described in EP application 03 100 617.4. 

3.2. Measurement of protein secretion 
3.2,1. Protocol 

15 Constructs were transfected into CHO cells using standard lipid mediated transfecHon 

protocols. The secreted proteins wero harvested after 48 hrs. A specific Elisa test was used to 
quantify the amount of IFNAR secreted in the supematant The transfectlons were nomialized 
with a ludferase construct oo-transfected with the IFNAR vector. A stendard luciferase assay 
was used as described In standard laboratory manuals (Maniatis et al.. Molecular Cloning: A 
20 Laboratory Manual, 2"^ edition. Cold Spring Harbor [.aboratory Press). 
3.2. f. Constructs compri^ngU)eCMV vector 
The IgSP-tPA pre-propeptide was about 2.3 fold more efficient in promoting secretion 
of the IFNAR protein in the supernatant than the native IFNAR signal peptide. 
3.2.2. Constructs comprising the promoter of the mCMV-IEl gene. 
25 The IgSP-tPA pre-propeptide was about 4.3 fold more efficient in promoting secretion 

of the IFNAR protein in the supematant than the native IFNAR signal peptide. 
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CLAIMS : 

1. A DNA construct comprising a sequence encoding an IgSP-tPA pre-propeptidc 
comprising an immunoglobulin signal peptide fused to a tissue-type piasminoger 

5 activator (tPA) propeptide. 

2. The DNA construct of claim 1, wherein said immunoglobulin signal peptide is a murine 
immunoglobulin signal peptide. 

3. The DNA construct of claim 2, wherein said murine Immunoglobulin signal peptide 
comprises SEQ ID NO: 3. 

10 4. The DNA constmct of any of claims 1 to 3, wherein said tPA propeptide is a human tP/ 

propeptide, the carboxyl-terminai extremity of said tPA propeptide consisting of amine 
acids Arg-Xaa-Arg-Arg. 

5. The DNA construct of daim 4, wherein said tPA propeptide consists of amino acids 24 tc 
32 of SEQ ID NO: 2. 

15 6. The DNA construct of any of claims 1 to 5, wherein said pre-propeptide comprises SEC 

IDNO:1. 

7. The DNA construct of any of claims 1 to 6, wherein said DNA construct encodes a fiusior 
polypeptide comprising said IgSP-tPA pie-propeptide fused to a polypeptide of interest. 

8. A DNA constmct comprising a sequence encoding a human tissue-type piasminoger 
20 activator propeptide (tPA) wherein the carboxyl-terminai extremity of said tPA propeptide 

(X>nslsts of amino acids Aig-Xaa-Arg-Arg. 

9. The DNA construct of claim 8, wherein said tPA propeptide consists of amino acids 24 tc 
32 of SEQ ID NO: 2. 

10. The DNA construc^t of (daim 8 or 9, further comprising a signal sequence fused to sai( 
25 tPA propeptide. 

11. The DNA construct of any of claims 8 to 10, wherein said DNA construct encodes s 
fusion polypeptide comprising said tPA propeptide fused to a polypeptide of interest. 

12. The DNA construct of any of claims 1 to 1 1, wherein said DNA construct is included In ; 
vector, 

30 1 3. The DNA constnjct of any of claims 1 2, wherein said vector is an expression vector. 
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14. The DNA construct of any of claims 1 2, wherein said vector is a gene activation vector. 

15. A host cell transformed with the DNA construct of any of claims 1 to 14. 

16. The host cell of claim 15, wherein said cell is selected from the group consisting of 
CHO cell, a COS cell, a CV1 cell, a mouse L cell, a HT1080 cell, a BHK cell, a HEK29 
cell, . a NIH-3T3 cell, a LM cell and a Yl cell. NSO and SP2/0 mouse hybridoma and thi 
like.Namalwa, RPMI-8226, Vero. WI-38. MRC-5 and the like. 

17. The host cell of claim 16. wherein said cell is a CHO cell. 

18. A process for the production of a polypeptide of interest comprising the step c 
transfecting a host cell with the DNA construct of any of claims 1 to 14. 

19. A process for the production of a polypeptide of interest comprising the step of culturini 
the host ceil of any of claims 15 to 17. 

20. The process of claim 18 or 19, further comprising the step of isolating the polypeptide o 
interest from said host cells. 

21. The process of any of claims 18 to 20, wherein the transfecHon is stable transfecUon. 

22. Use of the DNA construct of any of claims 1 to 14 for producing a polypeptide of interest 

23. A fusion polypeptide encoded by the DNA constructs of daim 7 or 1 1 . 
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ABSTRACT 

This invention encompasses novel leader sequences for production of proteins. Mor€ 
specifically, the invention relates to DMA constructs encoding leader sequences comprising ar 
5 immunoglobulin signal peptide fused to a tissue-type plasminogen activator propeptide, and tc 
DNA constructs encoding leader sequences comprising a truncated human tissue-typ€ 
plasminogen activator propeptide. The invention further relates to the use of these DNA 
constructs for producing proteins in mammalian cells. 
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SEQUENCE LISTING 

<110> Applied Research Systems ARS Holding N.V. 

<120> Novel leader sequences for use in production of proteins 

<130> 884 EP 

<160> 58 

<170> Patentin version 3.1 

<210> 1 

<211> 29 

<212> PRT 

<213> Artificial 

<220> 

<221> source . 

<223> /note="Description of artificial sequence: IgSP-tPA pre-propeptide 

<220> 

<221> SIGNAL 
<222> (1) . - (20) 

<223> murine immunoglobulin signal peptide 



<220> 

<221> PROPEP 

<222> (21) . . (29) 

<223> human tissue plasminogen activator propeptide 



<400> 1 

Met Lys Cys Ser Trp Val lie Phe Phe Leu Met Ala Val Val Thr Gly 
1 5 10 15 



Val Asn Ser Ser Gin Glu lie His Ala Arg Phe Arg Arg 
20 25 



<210> 2 

<211> 35 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> SIGN2VL 

<222> (1) • . (23) 
<223> 



<220> 

<221> PROPEP 
<222> (24) . . (35) 
5 <223> 

<400> 2 

Met Asp Ala Met Lys Axg Gly Leu Cys Cys Val Leu Leu Leu Cys Gly 
10 1 5 10 15 
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Ala Val Phe Val Ser Pro Ser Gin Glu He His Ala Arg Phe Arg Arg 
20 25 30 



Gly Ala Arg 
35 



20 



25 



30 



35 



<210> 
<211> 
<212> 
<213> 

<220> 
<221> 
<222> 
<223> 
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19 
PRT 

Mus musculus 



SIGNAL 
(1) . - (19) 



<400> 3 

Met Lys Cys Ser Trp Val He Phe Phe Leu Met Ala Val Val Thr Gly 
15 10 15 



Val Asn Ser 



45 



<210> 


4 


<211> 


26 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<221> 


SIGNAL 


<222> 


(1) . . (26) 


<223> 




<400> 


4 



Met Ala Thr Gly Ser Arg Thr Ser Leu Leu Leu Ala Phe Gly Leu Leu 
15 10 15 



5 Cys Leu Pro Trp Leu Gin Glu Gly Ser Ala 
20 25 



<210> 5 

10 <211> 17 

<212> PRT 

<213> Homo sapiens 

<220> 

15 <221> SIGNAL 

<222> (1) . . (17) 
<223> 



20 



<400> 5 



Met Leu Leu Leu Leu Leu Leu Leu Gly Leu Arg Leu Gin Leu Ser Leu 
15 10 15 



25 Gly 



<210> 6 

30 <211> 87 

<212> DNA 

<213> Artificial 



<220> 

35 <221> source , -r ot» 4.«iv 

<223> /note="Description of artificial sequence: nucleic acxd encoding igSP-tPA 

pre -propeptide " 



<400> 6 ^ ^a. 

40 atgaagtgca gctgggtgat cttcttcctg atggccgtgg tgaccggcgt gaattccagc 
caggagatcc acgccaggtt ccgcaga 

45 <210> 7 

<211> 105 

<212> DNA 

<213> Homo sapiens 

50 <400> 7 

atggacgcca tgaagcgcgg cctgtgctgc gtgctgctgc tgtgcggcgc cgtgttcgtg 



60 
87 



60 



agccccagcc aggagatcca cgccaggttc cgcagaggcg ccaga 



105 



<210> 8 

5 <211> 57 

<212> DNA 

<213> Mus musculus 

<400> 8 

10 atgaagtgca gctgggtgat cttcttcctg atggccgtgg tgaccggcgt gaattcc 57 

<210> 9 

<211> 161 

15 <212> PRT 

<213> Homo sapiens 

<400> 9 

20 Asp Ser Val Cys Pro Gin Gly Lys Tyr lie His Pro Gin Asn Asn Ser 
15 10 15 

lie Cys Cys Thr Lys Cys His Lys Gly Thr Tyr Leu Tyr Asn Asp Cvs 
25 20 25 30 

Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg Glu Cys Glu Ser Gly Ser 
35 40 45 

30 

Phe Thr Ala Ser Glu Asn His Leu Arg His Cys Leu Ser Cys Ser Lys 
50 55 60 

35 

Cys Arg Lys Glu Met Gly Gin Val Glu He Ser Ser Cys Thr Val Asp 
65 70 75 80 

40 Arg Asp Thr Val Cys Gly Cys Arg Lys Asn Gin Tyr Arg His Tyr Trp 

85 90 95 

Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys Ser Leu Cys Leu Asn Gly 
45 100 105 110 

Thr Val His Leu Ser Cys Gin Glu Lys Gin Asn Thr Val Cys Thr Cys 
115 120 125 

50 



His Ala Gly Phe Phe Leu Arg Glu Asn Glu Cys Val Ser Cys Ser Asn 



130 135 140 

Cvs Lys Lys Ser Leu Glu Cys Thr Lys Leu Cys Leu Pro Gin He Glu 
145 150 155 160 



Asn 



<210> 10 

<211> 239 

<212> PRT 

<213> Homo sapiens 

<400> 10 

Met Leu Leu Ser Gin Asn Ala Phe He Val Arg Ser Leu Asn Leu Val 
1 5 10 • 15 

Leu Met Val Tyr ILe Ser Leu Val Phe Gly He Ser Tyr Asp Ser Pro 
20 25 30 

Asp Tyr Thr Asp Glu Ser Cys Thr Phe Lys He Ser Leu Arg Asn Phe 
35 40 45 

Arg Ser He Leu Ser Trp Glu Leu Lys Asn His Ser He Val Pro Thr 
50 55 60 

His Tvr Thr Leu Leu Tyr Thr He Met Ser Lys Pro Glu Asp Leu Lys 
65 70 75 80 

Val Val Lys Asn Cys Ala Asn Thr Thr Arg Ser Phe Cys Asp Leu Thr 
85 90 95 

Asp Glu Trp Arg Ser Thr His Glu Ala Tyr Val Thr Val Leu Glu Gly 
100 105 110 

Phe Ser Gly Asn Thr Thr Leu Phe Ser Cys Ser His Asn Phe Trp Leu 
115 120 125 



Ala He Asp Met Ser Phe Glu Pro Pro Glu Phe Glu He Val Gly Phe 
130 135 140 



Thr Asn His lie Asn Val Met Val Lys Phe Pro Ser He Val Glu Glu 
^ 145 150 155 160 

Glu Leu Gin Phe Asp Leu Ser Leu Val He Glu Glu Gin Ser Glu Gly 
165 170 175 

10 

He Val Lys Lys His Lys Pro Glu He Lys Gly Asn Met Ser Gly Asn 
180 185 190 

15 Phe Thr Tyr He He Asp Lys Leu He Pro Asn Thr Asn Tyr Cys Val 
195 200 205 

Ser Val Tyr Leu Glu His Ser Asp Glu Gin Ala Val He Lys Ser Pro 
20 210 215 220 

Leu Lys Cys Thr Leu Leu Pro Pro Gly Gin Glu Ser Glu Phe Ser 
225 230 235 

25 

<210> 11 

<211> 34 

<212> DNA 

30 <213> Artificial 

<220> 

<221> source 

35 "^^^-^^ /note=="Description of artificial sequence: primer" 
<400> 11 

tgctctagag cgtcacccct agagtcgagc tgtg 34 



40 <210> 12 

<211> 59 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note=^"Description of artificial sequence: primer" 

<400> 12 

50 ggcgttgagc ggccgcggtt catgacgcta gcaccgaatt cacccctgta accactgcc 



<210> 13 

<211> 51 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note«"Description of artificial sequence: primer" 

10 <400> 13 

cgcagatctg gtctcgcatg aagtgcagct gggtgatctt cttcctgatg g 



<210> 14 

15 <211> 43 

<212> DNA 

<213> Artificial 

<220> 

20 <221> source 

<223> /note=" Description of artificial sequence: primer" 

<400> 14 

ccgtggtgac cggcgtgaat tccgacagcg tgtgccctca ggg 

25 

<210> 15 

<211> 52 

<212> DNA 

30 <213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

35 

<400> 15 

caagtacatc caccctcaga acaacagcat ctgctgcacc aagtgccaca ag 



40 <210> 16 

<211> 26 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 16 
50 ggcacctacc tgtacaacga ctgccc 



<210> 17 

<211> 51 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note=" Description of artificial sec[uence: primer" 

10 <400> 17 

gggcagtcgt tgtacaggta ggtgcccttg tggcacttgg tgcagcagat g 51 



<210> 18 

15 <211> 48 

<212> DNA 

<213> Artificial 



<220> 
20 <221> source 

<223> /note="Description of artificial sequence: prdLmer" 



25 



<400> 18 

ctgttgttct gagggtggat gtacttgccc tgagggcaca cgctgtcg 48 



<210> 19 

<211> 48 

<212> DNA 

30 <213> Artificial 



<220> 

<221> source 

<223> /no te=" Description of artificial sequence: primer" 
<400> 19 

gaattcacgc cggtcaccac ggccatcagg aagaagatca cccagctg 



40 <210> 20 

<211> 25 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note=="Description of artificial sequence: primer" 

<400> 20 
50 cacttcatgc gagaccagat ctgcg 



<210> 21 

<211> 47 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

10 <400> 21 

ccacaccatg gccaccggca gccgcaccag cctgctgctg gccttcg 



<210> 22 

15 <211> 50 

<212> DNA 

<213> Artificial 

<220> 

20 <221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 22 

gcctgctgtg cctgccctgg ctgcaggagg gcagcgccga cagcgtgtgc 

25 

<210> 23 

<211> 53 

<212> DNA 

30 <213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

35 

<400> 23 

cctcagggca agtacatcca ccctcagaac aacagcatct gctgcaccaa gtg 



40 <210> 24 

<211> 52 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 



<400> 24 

50 ccacaagggc acctacctgt acaacgactg ccctggccct ggccaggaca cc 



<210> 25 

<211> 52 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 
<400> 25 

gactgccgcg agtgcgagag cggcagcttc accgccagcg agaaccacct gc 

<210> 26 

<211> 55 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note-"Description of artificial sequence: primer" 
<400> 26 

gccactgcct gagctgcagc aagtgccgca aggagatggg ccaggtggag atcag 

<210> 27 

<211> 25 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 27 

cagctgcacc gtggaccgcg acacc 

<210> 28 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 
<400> 28 

ggtgtcgcgg tccacggtgc agctgctgat ctccacctgg cccatctcct tgc 



<210> 29 

<211> 53 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 

10 <400> 29 

ggcacttgct gcagctcagg cagtggcgca ggtggttctc gctggcggtg aag 



<210> 30 

15 <211> 53 

<212> DNA 

<213> Artificial 

<220> 

20 <221> source 

<223> /note=="Description of artificial sequence: primer" 

<400> 30 

ctgccgctct cgcactcgcg gcagtcggtg tcctggccag ggccagggca gtc 

25 

<210> 31 

<211> 56 

<212> DNA 

30 <213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

35 

<400> 31 

gttgtacagg taggtgccct tgtggcactt ggtgcagcag atgctgttgt tctgag 



40 <210> 32 

<211> 49 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 32 

50 ggtggatgta cttgccctga gggcacacgc tgtcggcgct gccctcctg 



<210> 33 

<211> 47 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note=="Description of artificial sequence: primer" 

10 <400> 33 

cagccagggc aggcacagca ggccgaaggc cagcagcagg ctggtgc 47 



<210> 34 

15 <211> 23 

<212> DNA 

<213> Artificial 



<220> 
20 <221> source 

<223> /note="Description of artificial sequence: primer" 



25 



<400> 34 

ggctgccggt ggccatggtg tgg 23 



<210> 35 

<211> 49 

<212> DNA 

30 <213> Artificial 



<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 
<400> 35 

gctcgagatc tggtctcgca tgctgctgct gctgctgctg ctgggcctg 49 



40 <210> 36 

<211> 41 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 

<400> 36 

50 aggctccagc tctccctggg cgacagcgtg tgccctcagg g 



<210> 37 

<211> 43 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note«" Description of artificial sequence: primer" 

<400> 37 

gcccagggag agctggagcc tcaggcccag cagcagcagc age 



<210> 38 

<211> 52 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 

<400> 38 

caagtacatc caccctcaga acaacagcat ctgctgcacc aagtgccaca ag 



<210> 39 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 39 

ggcacctacc tgtacaacga ctgccc 



<210> 40 

<211> 51 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 40 

gggcagtcgt tgtacaggta ggtgcccttg tggcacttgg tgcagcagat g 



<210> 41 

<211> 47 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

10 <400> 41 

ctgttgttct gagggtggat gtacttgccc tgagggcaca cgctgtc 47 



<210> 42 

15 <211> 51 

<212> DNA 

<213> Artificial 

<220> 

20 <221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 42 

cgcagatctg gtctcgcatg aagtgcagct gggtgatctt cttcctgatg g 51 



<210> 43 

<211> 52 

<212> DNA 

30 <213> Artificial 



<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 
<400> 43 

caagtacatc caccctcaga acaacagcat ctgctgcacc aagtgccaca ag 52 



40 <210> 44 

<211> 26 

<212> DNA 

<213> Artificial 



45 <220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 



<400> 44 

50 ggcacctacc tgtacaacga ctgccc 26 



<210> 45 

<211> 51 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 

10 <400> 45 

gggcagtcgt tgtacaggta ggtgcccttg tggcacttgg tgcagcagat g 



<21G> 46 

15 <211> 47 

<212> DNA 

<213> T^tificial 



<220> 

20 <221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 46 

ctgttgttct gagggtggat gtacttgccc tgagggcaca cgctgtc 

25 

<210> 47 

<211> 40 

<212> DNA 

30 <213> Artificial 



<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

35 

<400> 47 

tctggcgcct ctgcggaacc tggcgtggat ctcctggctg 



40 <210> 48 

<211> 48 

<212> DNA 

<213> Artificial 

45 <220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 48 

50 gaattcacgc cggtcaccac ggccatcagg aagaagatca cccagctg 



<210> 49 

<211> 43 

<212> DNA 

<213> Artificial 

5 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 



10 


<400> 49 
ccgtggtgac 


cggcgtgaat 


tccagccagg 


agatccacgc 


cag 




43 


15 


<210> 50 

<211> 486 

<212> DNA 

<213> Homo sapiens 












20 


<400> 50 
gacagcgtgt 


gccctcaggg 


caagtacatc 


caccctcaga 


acaacagcat 


ctgctgcacc 


60 




aagtgccaca 


agggcaccta 


cctgtacaac 


gactgccctg 


gccctggcca 


ggacaccgac 


120 


25 


tgccgcgagt 


gcgagagcgg 


cagcttcacc 


gccagcgaga 


accacctgcg 


ccactgcctg 


180 


agctgcagca 


agtgccgcaa 


ggagatgggc 


caggtggaga 


tcagcagctg caccgtggac 


240 




cgcgacaccg 


tgtgcggctg 


ccgcaagaac 


cagtaccgcc 


actactggag 


cgagaacctg 


300 


30 


ttccagtgct 


tcaactgcag 


cctgtgcctg 


aacggcaccg 


tgcacctgag 


ctgccaggag 


360 




aagcagaaca 


ccgtgtgcac 


ctgccacgcc 


ggcttcttcc 


tgcgcgagaa 


cgagtgcgtg 


420 


35 


agctgcagca 


actgcaagaa 


gagcctggag 


tgcaccaagc 


tgtgcctgcc 


tcagatcgag 


480 


aactaa 












486 



<210> 51 

40 <211> 19 

<212> DNA 

<213> Artificial 

<220> 

45 <221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 51 

ctagcaaaat aggctgtcc 

50 



<210> 52 



<211> 34 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 52 

gttccgcaga gacagcgtgt gccctcaggg caag 



<210> 53 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 53 

cacgctgtct ctgcggaacc tggcgtggat c 



<210> 54 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 

<400> 54 

ctgccgctct cgcactcgcg gcagtcggtg tcctggccag ggccagggca gtc 



<210> 55 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<221> source 

<223> /note="Description of artificial sequence: primer" 

<400> 55 

ctgctgctgt gcggcgccgt gttcgtgagc cccagccagg agatccacgc cag 



<210> 56 



<211> 
<212> 
<213> 



53 
DNA 

Artificial 



5 



<220> 

<221> source 
<223> /note=" 



Description of artificial sequence: primer" 



<400> 56 

ctgccgctct cgcactcgcg gcagtcggtg tcctggccag ggccagggca gtc 



53 



<210> 57 

<211> 57 

15 <212> DNA 

<213> Artificial 

<220> 

<221> source 

20 <223> /note="Description of artificial sequence: primer" 

<400> 57 

gggctcacga acacggcgcc gcacagcagc agcacgcagc acaggccgcg cttcatg 57 



25 



<210> 
<211> 
<212> 
<213> 



Artificial 



58 
30 
DNA 



30 



<220> 

<221> source 

<223> /note=" Description of artificial sequence: primer" 



35 



<400> 58 

gccaccatgg acgccatgaa gcgcggcctg 



30 
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